-
Notifications
You must be signed in to change notification settings - Fork 30
Enable architecture selection for DPCTL_TARGET_CUDA
#2096
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞 |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_8 ran successfully. |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_9 ran successfully. |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_10 ran successfully. |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_17 ran successfully. |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_22 ran successfully. |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_30 ran successfully. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_34 ran successfully. |
Array API standard conformance tests for dpctl=0.21.0dev0=py310h93fe807_37 ran successfully. |
This PR proposes to change
DPCTL_TARGET_CUDA
CMake option from a boolean to a string allowing users to specify a CUDA architecture (e.g.sm_80
). If not specified, it defaults tosm_50
.The specified architecture is used to construct a SYCL alias target (e.g.
nvidia_gpu_sm_80
) and passed via-fsycl-targets
option, following OneAPI for NVIDIA GPUsAdditionally removing
DPCTL_TARGET_CUDA
env handling logic